Particle filter

In statistics, particle filters, also known as Sequential Monte Carlo methods (SMC), are sophisticated model estimation techniques based on simulation.^[1] Particle filters have important applications in econometrics,^[2] and in other fields.

Particle filters are usually used to estimate Bayesian models in which the latent variables are connected in a Markov chain — similar to a hidden Markov model (HMM), but typically where the state space of the latent variables is continuous rather than discrete, and not sufficiently restricted to make exact inference tractable (as, for example, in a linear dynamical system, where the state space of the latent variables is restricted to Gaussian distributions and hence exact inference can be done efficiently using a Kalman filter). In the context of HMMs and related models, "filtering" refers to determining the distribution of a latent variable at a specific time, given all observations up to that time; particle filters are so named because they allow for approximate "filtering" (in the sense just given) using a set of "particles" (differently-weighted samples of the distribution).

Particle filters are the sequential ("on-line") analogue of Markov chain Monte Carlo (MCMC) batch methods and are often similar to importance sampling methods. Well-designed particle filters can often be much faster than MCMC. They are often an alternative to the Extended Kalman filter (EKF) or Unscented Kalman filter (UKF) with the advantage that, with sufficient samples, they approach the Bayesian optimal estimate, so they can be made more accurate than either the EKF or UKF. However, when the simulated sample is not sufficiently large, they might suffer from sample impoverishment. The approaches can also be combined by using a version of the Kalman filter as a proposal distribution for the particle filter.

1 Goal
2 Model
3 Monte Carlo approximation
4 Sequential Importance Resampling (SIR)
5 Sequential Importance Sampling (SIS)
6 "Direct version" algorithm
7 Other Particle Filters
8 See also
9 References
- 9.1 Bibliography
10 External links

Goal

The particle filter aims to estimate the sequence of hidden parameters, x_k for k = 0,1,2,3,…, based only on the observed data y_k for k = 0,1,2,3,…. All Bayesian estimates of x_k follow from the posterior distribution p(x_k | y₀,y₁,…,y_k). In contrast, the MCMC or importance sampling approach would model the full posterior p(x₀,x₁,…,x_k | y₀,y₁,…,y_k).

Model

Particle methods assume $x_k$ and the observations $y_k$ can be modeled in this form:

$x_0, x_1, \ldots$ is a first order Markov process such that

$x_k|x_{k-1} \sim p_{x_k|x_{k-1}}(x|x_{k-1})$

and with an initial distribution $p(x_0)$ .

The observations $y_0, y_1, \ldots$ are conditionally independent provided that $x_0, x_1, \ldots$ are known

In other words, each $y_k$ only depends on $x_k$

$y_k|x_k \sim p_{y|x}(y|x_k)$

One example form of this scenario is

$x_k = g(x_{k-1}) %2B w_k \,$

$y_k = h(x_k) %2B v_k \,$

where both $w_k$ and $v_k$ are mutually independent and identically distributed sequences with known probability density functions and $g(\cdot)$ and $h(\cdot)$ are known functions. These two equations can be viewed as state space equations and look similar to the state space equations for the Kalman filter. If the functions $g(\cdot)$ and $h(\cdot)$ are linear, and if both $w_k$ and $v_k$ are Gaussian, the Kalman filter finds the exact Bayesian filtering distribution. If not, Kalman filter based methods are a first-order approximation (EKF) or a second-order approximation (UKF in general, but if probability distribution is Gaussian a third-order approximation is possible). Particle filters are also an approximation, but with enough particles they can be much more accurate.

Monte Carlo approximation

Particle methods, like all sampling-based approaches (e.g., MCMC), generate a set of samples that approximate the filtering distribution $p(x_k|y_0,\dots,y_k)$ . So, with $P$ samples, expectations with respect to the filtering distribution are approximated by

$\int f(x_k)p(x_k|y_0,\dots,y_k)dx_k\approx\frac1P\sum_{L=1}^Pf(x_k^{(L)})$

and $f(\cdot)$ , in the usual way for Monte Carlo, can give all the moments etc. of the distribution up to some degree of approximation.

Sequential Importance Resampling (SIR)

Sequential importance resampling (SIR), the original particle filtering algorithm (Gordon et al. 1993), is a very commonly used particle filtering algorithm, which approximates the filtering distribution $p(x_k|y_0,\ldots,y_k)$ by a weighted set of P particles

$\{(w^{(L)}_k,x^{(L)}_k)~:~L\in\{1,\ldots,P\}\}$ .

The importance weights $w^{(L)}_k$ are approximations to the relative posterior probabilities (or densities) of the particles such that $\sum_{L=1}^P w^{(L)}_k = 1$ .

SIR is a sequential (i.e., recursive) version of importance sampling. As in importance sampling, the expectation of a function $f(\cdot)$ can be approximated as a weighted average

$\int f(x_k) p(x_k|y_0,\dots,y_k) dx_k \approx \sum_{L=1}^P w^{(L)} f(x_k^{(L)}).$

For a finite set of particles, the algorithm performance is dependent on the choice of the proposal distribution

$\pi(x_k|x_{0:k-1},y_{0:k})\,$ .

The optimal proposal distribution is given as the target distribution

$\pi(x_k|x_{0:k-1},y_{0:k}) = p(x_k|x_{k-1},y_{k}). \,$

However, the transition prior is often used as importance function, since it is easier to draw particles (or samples) and perform subsequent importance weight calculations:

$\pi(x_k|x_{0:k-1},y_{0:k}) = p(x_k|x_{k-1}). \,$

Sequential Importance Resampling (SIR) filters with transition prior as importance function are commonly known as bootstrap filter and condensation algorithm.

Resampling is used to avoid the problem of degeneracy of the algorithm, that is, avoiding the situation that all but one of the importance weights are close to zero. The performance of the algorithm can be also affected by proper choice of resampling method. The stratified sampling proposed by Kitagawa (1996) is optimal in terms of variance.

A single step of sequential importance resampling is as follows:

1) For $L=1,\ldots,P$ draw samples from the proposal distribution

$x^{(L)}_k \sim \pi(x_k|x^{(L)}_{0:k-1},y_{0:k})$

2) For $L=1,\ldots,P$ update the importance weights up to a normalizing constant:

$\hat{w}^{(L)}_k = w^{(L)}_{k-1} \frac{p(y_k|x^{(L)}_k) p(x^{(L)}_k|x^{(L)}_{k-1})} {\pi(x_k^{(L)}|x^{(L)}_{0:k-1},y_{0:k})}.$

Note that when we use the transition prior as the importance function, $\pi(x_k^{(L)}|x^{(L)}_{0:k-1},y_{0:k}) = p(x^{(L)}_k|x^{(L)}_{k-1})$ , this simplifies to the following :

$\hat{w}^{(L)}_k = w^{(L)}_{k-1} p(y_k|x^{(L)}_k),$

3) For $L=1,\ldots,P$ compute the normalized importance weights:

$w^{(L)}_k = \frac{\hat{w}^{(L)}_k}{\sum_{J=1}^P \hat{w}^{(J)}_k}$

4) Compute an estimate of the effective number of particles as

$\hat{N}_\mathit{eff} = \frac{1}{\sum_{L=1}^P\left(w^{(L)}_k\right)^2}$

5) If the effective number of particles is less than a given threshold $\hat{N}_\mathit{eff} < N_{thr}$ , then perform resampling:

a) Draw $P$ particles from the current particle set with probabilities proportional to their weights. Replace the current particle set with this new one.

b) For $L=1,\ldots,P$ set $w^{(L)}_k = 1/P.$

The term Sampling Importance Resampling is also sometimes used when referring to SIR filters.

Sequential Importance Sampling (SIS)

Is the same as Sequential Importance Resampling, but without the resampling stage.

"Direct version" algorithm

The "direct version" algorithm is rather simple (compared to other particle filtering algorithms) and it uses composition and rejection. To generate a single sample $x$ at $k$ from $p_{x_k|y_{1:k}}(x|y_{1:k})$ :

1) Set n=0 (This will count the number of particles generated so far)

2) Uniformly choose an index L from the range $\{1,..., P\}$

3) Generate a test $\hat{x}$ from the distribution $p_{x_k|x_{k-1}}(x|x_{k-1|k-1}^{(L)})$

4) Generate the probability of $\hat{y}$ using $\hat{x}$ from $p_{y|x}(y_k|\hat{x})$ where $y_k$ is the measured value

5) Generate another uniform u from $[0, m_k]$

6) Compare u and $p\left(\hat{y}\right)$

6a) If u is larger then repeat from step 2

6b) If u is smaller then save $\hat{x}$ as $x_{k|k}^{(p)}$ and increment n

7) If n == P then quit

The goal is to generate P "particles" at $k$ using only the particles from $k-1$ . This requires that a Markov equation can be written (and computed) to generate a $x_k$ based only upon $x_{k-1}$ . This algorithm uses composition of the P particles from $k-1$ to generate a particle at $k$ and repeats (steps 2-6) until P particles are generated at $k$ .

This can be more easily visualized if $x$ is viewed as a two-dimensional array. One dimension is $k$ and the other dimensions is the particle number. For example, $x(k,L)$ would be the L^th particle at $k$ and can also be written $x_k^{(L)}$ (as done above in the algorithm). Step 3 generates a potential $x_k$ based on a randomly chosen particle ( $x_{k-1}^{(L)}$ ) at time $k-1$ and rejects or accepts it in step 6. In other words, the $x_k$ values are generated using the previously generated $x_{k-1}$ .

Other Particle Filters

Auxiliary particle filter
Gaussian particle filter
Unscented particle filter
Monte Carlo particle filter
Gauss-Hermite particle filter
Cost Reference particle filter
Rao-Blackwellized particle filter

References

^ Doucet, A.; De Freitas, N.; Gordon, N.J. (2001). Sequential Monte Carlo Methods in Practice. Springer.
^ Thomas Flury & Neil Shephard, 2008. "Bayesian inference based only on simulated likelihood: particle filter analysis of dynamic economic models," OFRC Working Papers Series 2008fe32, Oxford Financial Research Centre.

Bibliography

Cappe, O.; Moulines, E.; Ryden, T. (2005). Inference in Hidden Markov Models. Springer.
Liu, J. (2001). Monte Carlo strategies in Scientific Computing. Springer.

Ristic, B.; Arulampalam, S.; Gordon, N. (2004). Beyond the Kalman Filter: Particle Filters for Tracking Applications. Artech House.

Doucet, A.; Johansen, A.M.; (December 2008). "A tutorial on particle filtering and smoothing: fifteen years later". Technical report, Department of Statistics, University of British Columbia. http://www.cs.ubc.ca/%7Earnaud/doucet_johansen_tutorialPF.pdf.

Doucet, A.; Godsill, S.; Andrieu, C.; (2000). "On sequential Monte Carlo sampling methods for Bayesian filtering". Statistics and Computing 10 (3): 197–208. doi:10.1023/A:1008935410038. http://www.springerlink.com/content/q6452k2x37357l3r/.

Arulampalam, M.S.; Maskell, S.; Gordon, N.; Clapp, T.; (2002). "A tutorial on particle filters for online nonlinear/non-Gaussian Bayesian tracking". IEEE Transactions on Signal Processing 50 (2): 174–188. doi:10.1109/78.978374. http://ieeexplore.ieee.org/xpls/abs_all.jsp?arnumber=978374.

Cappe, O.; Godsill, S.; Moulines, E.; (2007). "An overview of existing methods and recent advances in sequential Monte Carlo". Proceedings of IEEE 95 (5): 899. doi:10.1109/JPROC.2007.893250.

Kitagawa, G. (1996). "Monte carlo filter and smoother for non-Gaussian nonlinear state space models". Journal of Computational and Graphical Statistics (Journal of Computational and Graphical Statistics, Vol. 5, No. 1) 5 (1): 1–25. doi:10.2307/1390750. JSTOR 1390750.

Kotecha, J.H.; Djuric, P.; (2003). "Gaussian Particle filtering". IEEE Transactions Signal Processing 51 (10).

Haug, A.J. (2005). "A Tutorial on Bayesian Estimation and Tracking Techniques Applicable to Nonlinear and Non-Gaussian Processes". The MITRE Corporation, USA, Tech. Rep., Feb. http://www.mitre-corporation.net/work/tech_papers/tech_papers_05/05_0211/05_0211.pdf. Retrieved 2008-05-06.

Pitt, M.K.; Shephard, N. (1999). "Filtering Via Simulation: Auxiliary Particle Filters". Journal of the American Statistical Association (Journal of the American Statistical Association, Vol. 94, No. 446) 94 (446): 590–591. doi:10.2307/2670179. JSTOR 2670179. http://www.questia.com/PM.qst?a=o&se=gglsc&d=5002321997. Retrieved 2008-05-06.

Gordon, N. J.; Salmond, D. J. and Smith, A. F. M. (1993). "Novel approach to nonlinear/non-Gaussian Bayesian state estimation". IEEE Proceedings F on Radar and Signal Processing 140 (2): 107–113. doi:10.1049/ip-f-2.1993.0015. http://ieeexplore.ieee.org/xpls/abs_all.jsp?arnumber=210672. Retrieved 2009-09-19.

Chen, Z. (2003). Bayesian Filtering: From Kalman Filters to Particle Filters, and Beyond. http://citeseerx.ist.psu.edu/viewdoc/download?doi=10.1.1.107.7415&rep=rep1&type=pdf. Retrieved 2010-10-14.

External links

Feynman-Kac models and interacting particle algorithms (a.k.a. Particle Filtering) Theoretical aspects and a list of application domains of particle filters
Sequential Monte Carlo Methods (Particle Filtering) homepage on University of Cambridge
Dieter Fox's MCL Animations
Rob Hess' free software
SMCTC: A Template Class for Implementing SMC algorithms in C++
Tutorial on particle filtering with the MRPT C++ library, and a mobile robot localization video.
Java applet on particle filtering
An example of human tracking using Particle filters, c++ code is also available for download.